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Abstract. Procedures based on current methods to de- 
tect sources in X-ray images are applied to simulated 
XMM-Newton images. All significant instrumental effects 
are taken into account, and two kinds of sources are con- 
sidered - unresolved sources represented by the telescope 
PSF and extended ones represented by a /3-profile model. 
Different sets of test cases with controlled and realistic 
input configurations are constructed in order to analyze 
the influence of confusion on the source analysis and also 
to choose the best methods and strategies to resolve the 
difficulties. 

In the general case of point-like and extended objects 
the mixed approach of multiresolution (wavelet) filtering 
and subsequent detection by SExtractor gives the best 
results. In ideal cases of isolated sources, flux errors are 
within 15-20%. The maximum likelihood technique out- 
performs the others for point-like sources when the PSF 
model used in the fit is the same as in the images. How- 
ever, the number of spurious detections is quite large. 

The classification using the half-light radius and 
SExtractor stellarity index is succesful in more than 98% 
of the cases. This suggests that average luminosity clusters 
of galaxies {L[2-io]keV 3 x 10^'' erg/s) can be detected 
at redshifts greater than 1.5 for moderate exposure times 
in the energy band below 5 keV, provided that there is no 
confusion or blending by nearby sources. 

We find also that with the best current available pack- 
ages, confusion and completeness problems start to appear 
at fluxes around 6 x 10~^^ erg/s/cm^ in [0.5-2] keV band 
for XMM-Newton deep surveys. 

Key words: Methods: data analysis, Techniques: image 
processing. X-rays: general 



1. Introduction 

X-ray astronomy has entered a new era now that Chandra 
and XMM-Newton are in orbit. Their high sensitivities 
and unprecedented image qualities bear great promises 



but also pose new challenges. In this paper, we outline 
problems of object detection in X-ray images that were 
not previously encountered. In doing so, we compare the 
performances of various detection techniques on simulated 
XMM-Newton test images, incorporating the main instru- 
mental characteristics. 

The X-ray observations consist of counting incoming 
photons one by one, recording their time of arrival, po- 
sition and energy. Later, the event list is used to create 
images for a given pixel scale and energy band. Various X- 
ray telescope effects complicate this simple picture - the 
point spread function (PSF) and the telescope effective 
area (the vignetting effect), both dependent on the off- 
axis angle and incoming photon energy; detector effects 
like quantum efficiency variations, different zones not ex- 
posed to X-ray photons; environmental and background 
effects like solar flares and particle background. Even for 
relatively large exposures, the X-ray images could contain 
very few photons, and some sources could contain only a 
few tens of photons spread over a large area. Consequently, 
it is important for the source detection and characteriza- 
tion procedures to be able to cope with these difficulties. 

As an example, the same hypothetical input situation 
is shown schematically in Fig. |l| for ROSAT [], XMM- 
Newton ^ and Chandra ^ . XMM-Newton 's rather large 
PSF, coupled with its higher sensitivity, leads to the detec- 
tion of more objects but also to blending and source con- 
fusion, which become severe for long exposures depending 
on the energy band. Confusion problems in the hard band 
above 5 keV are less important, given the smaller number 
of objects and smaller count rate of energetic photons. 
Thus, why we concentrate our analysis mainly on source 
detection problems for the more complicated case of the 
XMM-Newton energy bands below 5 keV. 

Each X-ray mission provides data analysis packages - 
EXSAS for ROSAT (Zimmermann et al. |1994D , CIAO for 
Chandra (Dobrzycki et al. |1999[) and XMM-Newton Sci- 
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Table 1. The general parameters for the simulated im- 
ages. 

Parameter 

Image scale 4"/pixel 
Image size 512x512 
Exposure time lOks & lOOks 
Energy bands [0.5-2] & [2-10] keV 
PSF on axis 6" (FWHM) 
15" (HEW) 
Total background (pn-|-2MOS) 
[0.5 - 2] keV 1.78 x lO"'^ cts/s/pixel 

[2 - 10] koV 



0.0041 cts/s/arcmin^ 
2.4 X 10"^ cts/s/pixel 
0.0055 cts/s/arcmin^ 



1 5" 



Fig. 1. Typical representation of objects seen by ROSAT 
-HRI, XMM-Newton -EPIC and Chandra -HRMA. The 
objects are represented by 5-functions and folded by 
the corresponding instrumental PSF and efficiency. FuU- 
width-at-half-maximum (FWHM) of the on-axis PSFs are 
1'.'7 for ROSAT -RRl, O'.'S for Chandra -HRMA and 6" for 
XMM-Newton -EPIC. The dotted line represents schemat- 
ically the detection limit. 

ence Analysis System (XMM-SASQ). They include proce- 
dures for source detection, and in this paper we estimate 
and compare their performances on simulated images us- 
ing various types of objects. These procedures make use 
of techniques such as Maximum Likelihood (ML), Wavelet 
Transformation (WT), Voronoi Tessellation and Percola- 
tion (VTP). 

In Section || we describe the X-ray image simulations. 
A short presentation of the detection procedures is given 
m Sec. |. Tests using only point sources are presented in 
Sec. |[ and extended sources in Sec. ||. We have analyzed 
realistic simulations of a shallow and a deep extragalactic 
field with only point sources in Sec. ^ and with extended 
objects in Sec. ^for an exposure of 10 ks. Finally, we inves- 
tigate the problems of confusion and completeness in two 
energy bands - [0.5-2] and [2-10] keV for two exposures - 
10 ks and 100 ks (Sec. ||). Sec. || presents the conclusions. 
(Hq — 50 km/s/Mpc, h = 0.5, qo = 0.5 and JIq = 1 are 
used). 

2. Simulation of X-ray images 

The simulations are essential to understand and qualify 
the behavior of the different detection and analysis pack- 
ages. We have developed a simulation program that gener- 
ates X-ray images for given exposure times with extended 
and point-like objects. It takes into account the main in- 
strumental characteristics of XMM-Newton and the total 



sensitivity of the three EPIC instruments. The procedure 
is fast and flexible and is made of two independent simula- 
tion tasks: object generation (positions, fluxes, properties) 
and instrumental effects. A possibility to apply the instru- 
mental response directly over images is also implemented, 
especially useful when one wants to use sky predictions 



from numerical simulations (cf. Pierre et al. 2000). 

A summary of the simulated images parameters are 
given in Tab. |l} 

The point-like sources are assumed to be AGNs or 
QSOs with a power law spectrum with a photon index 
of 2 and flux distribution following t he log TV — log S re- 
lations of Hasinger et al. ( 1998 , 2001 ) and Giacconi et al. 
( 2000| ) in the two energy bands. 

The PSF model is derived from the current available 
calibration file^. On-board PSF data is generally in very 
good agreement with the previous ground based calibra- 
tions (Aschenbach et al. 200C). We must stress that the 
model PSF is an azimuthal average and in reality, espe- 
cially for large off-axis angles, its shape can be very dis- 



torted. However, in the analytic model (Erd et al. 2000), 
the off-axis and energy dependences are not available yet. 
This is not crucial, as the energy dependence in the bands 
used is moderate and we conflne all the analysis inside 10' 
from the centre of the field-of-view where the PSF blurring 
is negligible. 

The extended object s are m odeled by a /3-profile (Cav- 
aliere & Fusco-Femiano 1976 ) with fixed core radius rc — 
250 kpc and /3 — 0.75 . A thermal plasma spectrum 
(Raymond & Smith 1977) is assumed for different tem- 



peratures, luminosities and redshifts. 

The source spectra (extended and point-like) are 
folded with the spectral response function for the total 
sensitivity of the three XMM-Newton EPIC instruments 
(MOSl, M0S2 and p n with thin fiffers) by means of 
XSPEC (Arnaud 1996) to produce the count rates in dif- 



ferent energy bands. The actual choice of the energy bands 
is not important for this comparison study, although some 
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Table 2. Count rates for extended sources in three en- 
ergy bands calculated assuming an average luminosity 
iL[2-io]keV - 2.8 X 10^4 erg/s, = 5 keV) /3-profile 
cluster of galaxies with a Raymond-Smith spectrum, 
Nh = 5 X 10^° cm~^ and spectral response functions for 
the three EPIC detectors with thin filters. 



2 


Core radius 




Count-rate, photons/s 




arcsec 


[0.4-4] 


keV [0.5-2] keV 


[2-10] keV 


0.6 


32.8 


0.1687 


0.1316 


0.0362 


0.7 


31.3 


0.1238 


0.0963 


0.0253 


0.8 


30.4 


0.0942 


0.0734 


0.0185 


0.9 


29.7 


0.0737 


0.0577 


0.0139 


1.0 


29.3 


0.0593 


0.0465 


0.0107 


1.1 


29.1 


0.0486 


0.0382 


0.0085 


1.2 


29.0 


0.0406 


0.0319 


0.0068 


1.3 


29.0 


0.0343 


0.0270 


0.0055 


1.4 


29.1 


0.0293 


0.0231 


0.0046 


1.5 


29.2 


0.0253 


0.0200 


0.0038 


1.6 


29.4 


0.0220 


0.0175 


0.0032 


1.7 


29.6 


0.0193 


0.0154 


0.0027 


1.8 


29.9 


0.0171 


0.0137 


0.0023 


1.9 


30.2 


0.0152 


0.0122 


0.0020 


2.0 


30.5 


0.0137 


0.0109 


0.0018 



objects can be more efficiently detected in particular en- 
ergy ranges. 

As an example, we show in Tab. ^ the resulting count 
rates for extended sources assuming that they represent 
an average cluster of galaxies. 

The background in the simulations includes realis- 
tic cosmic diffuse and background values derived from 
the XMM-Newton in-orbit measurements in the Lockman 



Hole (Watson et al. |200lD . 

The calculated count rates for the objects and the pho- 
tons of the background are subject to the vignetting effect 
- some photons are lost due to the smaller telescope effec- 
tive area at given off-axis angle, depending on the incom- 
ing photon's energy. We have parametrized the vignetting 
factor - the probability that a photon at an off-axis an- 
gle 9 to be observed - as polynomials of fourth order in 
two energy bands: [0.5-2] and [2-10] keV, using the latest 
XMM-Newton on-flight calibration data. For example, a 
photon at 6* = 10' has a 53% chance of being observed in 
[0.5-2] keV and 48% in [2-10] keV. 

Further instrumental effects such as quantum efh- 
ciency difference between the CCD chips, the gaps be- 
tween the chips, out-of-time events, variable background, 
pile-up of the bright sources are not taken into account - 
their inclusion is not relevant for our main objective. 

3. Detection procedures 

Without attempting to provide a review of the avail- 
able techniques in the literature, we briefly describe here 
the procedures we have tested. They are summarized in 
Tab. 0. 



3.1. Sliding cell detection and maximum likelihood (ML) 
method 

Historically, the sliding cell detection method was first 
used for Einstein Observatory observations (e.g. EMSS - 
Gioia et al. 1990). It is included in ROSAT , Chandra and 
XMM-Newton data analysis tools and a good description 
can be found in the specific documentation for each of 
those missions. 

The X-ray image is scanned by a sliding square box 
and if the signal-to-noise of the source centered in the box 
is greater than the specified threshold value it is marked 
as an object. The signal is derived from the pixel values 
inside the cell and noise is estimated from the neighboring 
pixels. Secondly, the objects and some zone around them 
are removed from the image forming the so-called "cheese" 
image which is interpolated later by a suitable function 
(generally a spline) to create a smooth background image. 
The original image is scanned again but this time using a 
threshold from the estimated background inside the run- 
ning cell to give the map detection object list. 

The procedure is fast and robust and does not rely on a 
priori assumptions. However it has difficulties, especially 
in detecting extended features, close objects and sources 
near the detection limit. Many refinements are now imple- 
mented improving the sliding cell method: (1) consecutive 
runs with increasing cell size, (2) matched filter detection 
cell where the cell size depends on the off- axis angle. How- 
ever, the most important improvement was the addition 
of the maximum likelihood (ML) technique to further an- 
alyze the detected sources. 

The ML technique was first applied to analyze ROSAT 
observations (Cruddace et al. 1988, 1991, Hasinger et al. 



1993). It was used to produce all general X-ray surveys 
from ROSAT mission (e.g. R ASS - Voges et al. |1999 , 
WARPS survey - Ebeling et al. |2000| ). The two lists from 
local and map detection passes can be merged to form 
the input objects list for the ML pass. It is useful to 
feed the ML procedure with as many candidate objects 
as possible, having in mind that large numbers of objects 
could be very CPU-expensive. The spatial distribution of 
an input source is compared to the PSF model - the like- 
lihood that both distributions are the same - is calcu- 
lated with varying the input source parameters (position, 
extent, counts) and the corresponding confidence limits 
can be naturally computed. A multi-PSF fit is also im- 
plemented which helps in deblending and reconstructing 
the parameters of close sources. In the output list, only 
sources with a likelihood above a threshold are kept. 

The ML method performs well and has many valu- 
able features, however, it has some drawbacks - it needs a 
PSF model to perform the likelihood calculation and thus 
favours point-like source analysis, extent likelihood could 
be reliably taken only for bright sources, it cannot detect 
objects which are not already presented in the input list 
(e.g. missing detections in the local or map passes). 



I. Valtchanov et al.: Source detection for XMM-Newton 



Table 3. Designation, implementation and short description of the procedures or method used for detection and 
analysis. 



Procedure 


Implementation 


Version 


Method 


EMLDETECT 


XMM-SAS v5.0 


3.7.2 


Cell detection + Maximum likelihood 


VTPDETECT 


Chandra CIAO 


2.0.2 


Voronoi Tessellation and percolation 


WAVDETECT 


Chandra CIAO 


2.0.2 


Wavelet 


EWAVELET 


XMM-SAS v5.0 


2.4 


Wavelet 


G+SE 


Gauss + SExtractor 


2.1.6 


Mixed - gauss convolution followed by 
SExtractor detection 


MR/l+SE 


MR/1 + SExtractor 


2.1.6 


Mixed - multi-resolution filtering followed 
by SExtractor detection 



Here we have used EMLDETECT - an implementation of 
the m ethod specifically adapted for XMM-SAS (Brunner 



1996 ). In the map mode sliding cell pass we used a low 
signal-to-noise ratio 3cr) above the background in or- 
der to have as many as possible input objects for the ML 
pass. The likelihood limit (given by L = — InP, where P is 
the probability of finding an excess above the background) 
was taken to be 10, which corresponds roughly to Aa de- 
tection. A multi-PSF fitting mode with the maximum of 
6 simultaneous PSF profile fits was used. 



3.2. VTP 



VTP - the Voronoi Tessellation and Percolation method 



(Ebeling & Wiedenmann 1993, Ebeling 1993) is a general 
method for detecting structures in a distribution of points 
(photons in our case) by choosing regions with enhanced 
surface density with respect to an underlying distribution 
(Poissonian in X-ray images). It treats the raw photon dis- 
tribution directly without any recourse to a PSF model or 
a geometrical shape of the objects it finds. Each photon 
defines a centre of a polygon in the Voronoi tessellation 
image and the surface brightness is simply the inverse area 
of the polygon (assuming one single photon per cell) . The 
distribution function of the inverse areas of all photons 
is compared to that expected from a Poisson distribution 
and all the cells above a given threshold are fiagged and 
percolated, i.e. connected to form an object. This method 
was successfully used with ROSAT data (Scharf et al. 



1997) and is currently incorporated in the Chandra DE- 



TECT package (Dobrzycki et al. 1999) 



Apart from these advantages, VTP has some draw- 
backs which are especially important for XMM-Newton 
observations: (1) because of the telescope's high sensitiv- 
ity and rather large PSF with strong tails, the percolation 
procedure tends to link nearby objects; (2) excessive CPU 
time for images with relatively large number of photons; 
(3) there is no simple way to estimate the extension of 
objects. 



3.3. Wavelet detection procedures 

In the past few years a new approach has been extensively 
used: the wavelet technique (WT). This method consists 
in convolving an image with a wavelet function: 



X y 

Wa{x, y) = I{x, y) (g) ■0(-, -), 



(1) 



where Wa are the wavelet coefficients corresponding to a 
scale a, I(x, y) is the input image and ip is the wavelet 
function. The wavelet function must have zero normaliza- 
tion and satisfy a simple scaling relation 



i^a{x,y) = -)■ 
a a a 



(2) 



The choice of ■0 is dictated by the nature of the prob- 
lem but most often the second derivative of the Gaussian 
function or the so called "Mexican hat" is used. 

The WT procedure consists of decomposing the orig- 
inal image into a given number of wavelet coefficient im- 
ages, Wa, within the chosen set of scales a. In each wavelet 
image, features with characteristic sizes close to the scale 
are magnified and the problem is to mark the signifi- 
cant ones, i.e. those which are not due to noise. In most 
cases, this selection of significant wavelet coefficients can- 
not be performed analytically because of the redundancy 
of the WT introducing cross-correlation between pixels. 
For Gaussian white noise, Wa are distributed normally, al- 
lowing easy thresholding. This is not the case for X-ray 
images which are in the Poissonian photon noise regime. 

Various techniques were developed for selecting the sig- 
nificant wavelet coefficients in X-ray images. In Vikhlinin 



et al. (1997) a local Gaussian noise was assumed; Slezak 
et al. (1994) used the Ascombe transformation to trans- 
form an image with Poissonian noise into an image with 
Gaussian noise; in Slezak et al. ( |1993| ), Starck & Pierre 



(1998) a histogram of the wavelet function is used. In re- 
cent years a technique based on Monte Carlo simulations 



is used successfully (e.g. Grebenev et al. 1995, Damiani et 
al. [1997| , Lazzati et al. |199g| ). 



Once the significant coefficients at each scale are cho- 
sen, the local maxima at all scales are collected and cross- 
identified to define objects. Different characteristics, such 
as centroids, light distribution etc., can be computed, as 
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well as an indication of the source size at the scale where 
the object wavelet coefficient is maximal. 

WT has many advantages - the multiresolution ap- 
proach is well suited both for point-like and extended 
sources but favours circularly symmetric ones. Because of 
the properties of the wavelet function a smoothly varying 
background is automatically removed. Extensive descrip- 
tion of wavelet transform and its different applications can 
be found in Starck et al. (1998). 



(1) Gaussian convolution. For our simulated images we 
applied two convolutions with FWHM=12" and 20" 
constrained by the image characteristics (see Sec. ||). 

(2) Multiresolution iterative threshold filtering (MR/1 
package, Starck et al. 1998D by means of "a trous" 



In this work we have tested two WT procedures: 

WAVDETECT: one of the Chandra wavelet-based detection 
techniques (Freeman et al. 1996, Dobrzycki et al. 



1999). It uses a "Mexican hat" as wavelet function and 



identifies the significant coefficients by Monte Carlo 
simulations. The background level needed to empiri- 
cally estimate the significance is taken directly from 
the negative annulus of the wavelet function. Source 
properties are computed inside the detection cell de- 
fined by minimizing the function | log rpsF — logCFl, 
where rpsF is the size of the PSF encircling a given 
fraction of the total PSF flux and ap is the size of 
the object at the scale closest to the PSF size. No de- 
tailed PSF shape information is needed to perform this 
minimization - just rpsF as a function of the off-axis 
angle. The control parameters which need attention 
are the significance threshold and the set of scales. 
We have used a significance threshold of 10^^ corre- 
sponding to 1 false event in 10000 (~ 4(t in Gaus- 
sian case) and "\/2 sequence" for the scales, where 
a= 1,V2,2,2%/2,4,3V2...16. 
EWAVELET: XMM-SAS package, based on a wavelet anal- 
ysis. It is very similar to WAVDETECT but implements 
some new ideas. The sources are assumed to have a 
Gaussian shape in order to analytically derive their 
counts and extent. Currently, the PSF information is 
ignored, which can be regarded as a serious drawback. 
We have used the significance threshold of 10"'* (~ 4cr) 
and wavelet scales 1, 2, 4, 8 and 16. 



3.4- Mixed approach 

This method combines a source detection by an elaborated 
procedure over filtered/smoothed raw photon images. 

The use of such a mixed approach is motivated by the 
fact that procedures for source detection in astronomical 
images have been developed for many years and the steps 
and problems of deblending, photometry, classification of 
objects are now quite well understood. The raw photon 
image manipulations can be performed with very simple 
smoothing procedures (for example a Gaussian convolu- 
tion) or with more sophisticated methods like the "match- 
ing filter" technique, adaptive smoothing or multiresolu- 
tion (wavelet) filtering. 

We have used two different types of raw image filtering: 



(with holes) wavelet method and Poissonian noise 
model (Slezak et al. [1993^ Starck & Pierre |199SD to flag 
the significant wavelet coefficients (also known as auto- 
convolution or wavelet function histogram method). 
The control parameters are the significance threshold, 
wavelet scales and the number of iterations for the im- 
age reconstruction. We took 10~^ significance thresh- 
old (corresponding to ~ 4cr for Gaussian distribution), 
6 scales for the "a trous" method allowing analysis 
of structures with characteristic sizes up to 2^ pixels 
(~ 4'). We have used 25 iterations or a reconstruc- 
tion error less than 10~^. More iterations can improve 
the photometry but can also lead to the appearance of 
very faint real or sometimes false features. Optionally 
the last-scale wavelet smoothed image in the iterative 
restoration process is used for analysis of large-scale 
background variations or very extended features. 

Source detection over the filtered images was per- 
formed with SExtractor (Bertin & Arnouts 1996| ) - one 
of the most widely used procedures for object detection in 
astronomy with a simple interface and very fast execution 
time. Originally, SExtractor was developed for optical 
images but it can be applied very successfully on X-ray 
images after they are properly filtered. The flexibility of 
SExtractor is ensured by a large number of control pa- 
rameters requiring dedicated adjustments. In the following 
we will briefly outline those having a significant influence 
on our results: 

— Detection and analysis threshold - important param- 
eters for Gaussian convolved images. It is preferable 
to use a value greater than 4(t, even sacrificing very 
faint sources but reducing the number of false detec- 
tions. For MR/1 filtering this is irrelevant because the 
features in the images are already at 4cr significance. 

— Background mesh size - influences object detection as 
well as photometry. Using too coarse a background grid 
(less grid points) will lead to a smoother background 
image but some faint objects will be missed in the de- 
tection. While too detailed a map (more grid points) 
will lead to many false detections and worse photome- 
try, depending on the local noise properties. We have 
found after many experiments that local background 
meshes with 32 to 64 nodes give best results balancing 
both effects. 

— Minimum detection area - in some cases this param- 
eter can help to avoid spurious detections especially 
when one uses wavelet scales below the characteristic 
size of objects. 

— Deblending parameters - the number of the deblending 
levels and the minimum contrast. We used the extreme 
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values of 64 levels and zero contrast so that any saddle 
point will lead to object splitting. 
Photometry - we have decided to use the auto- 
matic aperture photometry procedure implemented in 



Table 4. Test 1. Detection results, 
input objects is 36. 



The total number of 



SExtractor following Kron ( [1980D and Infante ( [1987D 
which gives better results than the fixed aperture or 
isophotal methods. 
— Classification - we discuss the classification in the next 
subsection. 

3.5. Classification 

The problem of classifying sources as resolved or unre- 
solved (or stars/galaxies, extended/point-like) has a long 
history and discussions can be found in widespread detec- 
tion package s like FOCAS (Valdes et al. |199C| ), INV ENTOR Y 
(Kruszewski |l988| ), SExtractor (Bert in & Arnouts [l99^ , 
Neural Extractor (Andreon et al. 2000). This task becomes 
even more difficult for X-ray images - we have already 
mentioned various X-ray telescope effects that blur and 
distort the images as a function of the off-axis angle. 

EMLDETECT - the extension likelihood is calculated by the 
deviation of the object fitted from the PSF model when 
varying the size (extension) of the object. It was used 
with iiOiS^T observations to flag possible extended ob- 
jects and in most of the cases the results were positive. 
The extension likelihood depends on the chosen PSF 
model and for faint objects is not reliable (De Grandi 
et al. p^97| ). 

WAVDETECT - the ratio between object size and the PSF 
size {Rpsf) is used for classification purposes. It de- 
pends on the adopted definition for object size and can 
be misleading, especially for faint objects. 

SExtractor - one parameter for classification is the stel- 
larity index. It is based on neural-network training per- 
formed over set of some 10^ simulated optical images. 
There are 10 parameters included in the neural net- 
work training - 8 isophotal areas, the maximal inten- 
sity and the "seeing" . The only controllable parameter 
is the "seeing" which, however, has no meaning for 
space observations. It can be tuned to the mean PSF 
size. 

The half-light radius R^o is another classification pa- 
rameter. It can be used as a robust discriminator be- 
cause it depends only on the photon distribution inside 
the object ("luminosity profile"), which is assumed to 
be different for resolved and unresolved objects. 

Additional indications might be taken into account when 
the classification is ambiguous - e.g. the wavelet scale at 
which object's wavelet coefficient is maximal or even spec- 
tral signatures. 



4. Test 1 point-like sources 



Method 


Missed 


False 


EMLDETECT 


4 


13 


G+SE (4a) 


6 


1 


MR/l+SE 


7 


1 


WAVDETECT 


7 


21 


EWAVELET 


6 


4 


VTPDETECT 


12 


19 



4-.1. Input configuration 

We address the problem of point-like sources separated by 
15" (half-energy width of the on-axis PSF), 30" and 60" 
with different flux ratios. We include the PSF model and 
background but do not apply the vignetting effect. 

The raw input test image is shown in Fig. || together 
with its Gaussian convolution, MR/ 1 wavelet filtering and 
WAVDETECT output image. Visually, the Gaussian image is 
quite noisy, while there are few spurious detections in the 
WT images. 

4.2. Detection rate and positional errors 

The number of missing detections and false objects are 
shown in Tab. ^. 

The one sigma input-detect position differences are 
less than the FWHM of the PSF (6") for all procedures 
and the maximum occurs for the blended objects, as ex- 
pected. Note the large number of spurious detections with 
WAVDETECT, VTPDETECT and EMLDETECT. 

4.3. Photometric accuracy 

The results for the photometry in terms of the inferred to 
the input counts arc shown in Fig. ^. 

Sr = 15". Only EMLDETECT detects two of the six fainter 
objects. None of the other procedures separates the ob- 
jects and consequently the inferred counts are a blend 
from both sources. 

5r — 30". The proximity of objects influences the detec- 
tion and the photometry. EMLDETECT and EWAVELET 
show the best detection rate results while all the other 
procedures miss one of the faintest objects. 

5r = 60". We can safely assume that the objects are well 
separated. The recovery of the properties is informa- 
tive for the performance of the tested procedures. It is 
clear that the general flux reconstruction error (taken 
to be the spread of the points around the unity line 
in Fig. H) is about 15% for brighter sources and goes 
down to 20-25% for the faintest ones. [In our 10 ks ex- 
posure tests and with the adopted background in band 
[0.5-2] keV, we assume the objects with input counts of 
20 photons (~ 10~^^ erg/s/cm^) to be at the detection 
limit when there is no confusion by nearby sources.] 
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Fig. 2. Test 1. The raw X-ray photon image with object counts for 10 ks exposure time (upper left). There are three 
sets of two columns of objects at separations of 15" (HEW of the PSF), 30" and 60". The Gaussian convolution with 
FWHM= 12" (upper right), MR/1 wavelet filtering (lower left) and WAVDETECT detection image (lower right) both 
with a significance threshold of 10~^ are shown. 



4-.4- Discussion 

After this simple test we can eliminate the VTPDETECT: 
in addition to the very large execution time, some of the 
VTP-detected object centres were shifted by more than 
20" from their input positions - a consequence of its abil- 
ity to detect sources with different shapes where the ob- 
ject center can be far from the input position. Moreover, 
VTPDETECT percolates all the double sources into single 
objects at 5r — 30", which all other procedures were able 
to separate. 

No procedure unambiguously shows best results - both 
in terms of the detection rate, spurious sources and photo- 
metric reconstruction. EMLDETECT outperforms the others 
in terms of detection rate but with the price of many spu- 
rious detections. Using exactly the same PSF model as 



the one hard-coded in EMLDETECT leads to much better 
photometric reconstruction. 

All other procedures are comparable: EWAVELET show- 
ing better detection but its photometric reconstruction is 
far from satisfactory - about half of the photons were lost 
at Sr = 30" and 60", because of the assumed Gaussian 
shape used to derive analytically extension and counts. 
We have applied a simple correction for the object size 
to arrive at the good photometric results for EWAVELET 
presented on Fig. ^ 

5. Test 2 point-like plus extended objects 

5.1. Input configuration 

This test is similar to Test 1 , but we have replaced some of 
the point-like sources by extended ones generated as de- 
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Fig. 3. Test 1. The three panels of each figure show the results in terms of the ratio of inferred counts SCTS{out) and 
input counts SCTS{in) as a function of the varying input counts for the three cases of object separations (indicated 
by 5r). The mean and st.dev. of the corresponding points are also indicated. When there are no detections, the mean 
and the st.dev. are both zero. Objects with input counts fixed at 100 (squares) are placed beside their corresponding 
neighbors (circles), rather than being plotted at 100. 
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Fig. 4. Test 2. The raw X-ray photon image for 10 ks exposure time (upper left). As in Fig. ^ three cases of 
separations are indicated, as well as the corresponding input source counts. The extended objects are in the right 
columns. The Gaussian convolution with FWHM= 20" (upper right), MR/1 WT filtered image (lower left) and 
WAVDETECT reconstructed image both with a significance threshold of 10~* (lower right) are shown. 



scribed in Sec. y. The raw photon image with input counts 
indicated and its representations are shown in Fig. |^. 

5.2. Detection rate and positional errors 

The number of missed and false detections are shown in 
Tab. |. An increase of the searching radius to 20" was 
needed: at Sr = 15" the blending tends to shift the cen- 
troid towards the point-like source. Note that this situ- 
ation is a clear case for source confusion: if we take the 
closest neighbour (the point-like source in some cases) as 
the cross-identification from the input list, we shall over- 
estimate the flux more than two-fold, while the true rep- 
resentation is the extended object. 

Some changes were needed for the procedures not 
based on the wavelet technique in order to avoid splitting 
of the bright extended objects into sub-objects: increase of 



the Gaussian convolution FWHM to 20", and multi-PSF 
fit for EMLDETECT. In the Gaussian case, the larger smooth- 
ing length smears some of the point-like sources, leading to 
non-detection. EMLDETECT splitting persists even with the 
maximum number of the PSFs fitted to the photon dis- 
tribution (currently it is capable of simultaneously fitting 
up to 6 PSFs). 

5.3. Photometric reconstruction 

The inferred-to-input source counts ratio 
SCTS{out)/SCTS{in) is shown in Fig. |. We will 
not consider EWAVELET as its photometric results and 
detection of extended sources were quite unsatisfactory. 
The simple correction technique (as in Test 1) based on 
the PSF does not work - the extended objects profile has 
different shape from the PSF. 
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Fig. 5. Test 2. As in Fig. |^ except that the squares now represent extended objects and the point-hke sources at 
fixed counts of 100 (circles) are put beside their corresponding neighbors (rather than being put at 100), Circles with 
arrows and numbers denote the ratio when it is above 2, 



Table 5. Test 2 resuhs for the detection rate. The first 
number in the "Missed" column is for point sources while 
the second is for extended ones. The total number of input 
sources is 24=12+12, 



Method 


Missed 


False 


EMLDETECT 


1+1 


89 


G+SE 4a 


6+0 


1 


MR/l+SE 


6+0 


6 


WAVDETECT 


6+0 


18 


EWAVELET 


4+2 


5 



6r = 15", Only MR/l+SE gives good results for the flux 
restoration of the extended objects, but overestimat- 
ing the counts for the faintest one, WAVDETECT misses 
almost half of the input photons while EMLDETECT split- 
ting leads to very poor results. 



Sr = 30". All procedures give bad restoration results 
with MR/l+SE performing best again for the extended 
sources. The proximity of the objects leads to an over- 
estimation of the point-like source counts and an un- 
derestimation of the extended object counts. There is 
no simple way to correct for this effect, but can be done 
using a rather elaborated iterative procedure involving 
extended object profile fitting, 

Sr — 60", Point-like source results are relatively similar 
with all procedures - the source counts are slightly 
overestimated due to the extended object halo even at 
60". The problems of WAVDETECT and EMLDETECT and 
the recovery of the extended objects counts are quite 
obvious. Again MR/l+SE is the best performing pro- 
cedure with extended objects flux uncertainty about 
25-30%. 
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5.4- Object classification 

An important test is the ability of the procedures to clas- 
sify objects and to allow further analysis of complicated 
cases of blending. The MR/l+SEresults for the clasifica- 
tion by means of the half-light radius and stellarity index 
(c.f. Sec. ^) are shown in Fig. ^, overlayed over the results 
from 10 simulated images with only point-like sources. 
Clearly the detected extended objects with MR/l+SE fall 
into zones not occupied by point-like sources. Note, how- 
ever, that the two detected point-like sources at 30" will 
be mis-clasified as extended objects - the proximity not 
only influences the source counts, overestimated by more 
than 4-5 times (Fig. ||), but also the object profile and 
consequently the classification. 

Fig. shows the WAVDETECTclassification - the ratio 
of the object size to the PSF size (Rpsf)- The results 
are more ambiguous with WAVDETECT (Fig. |^) compared to 
MR/l+SE. The results with EMLDETECT and its classification 
parameter (extension likelihood) were very unsatisfactory 
due to the extended object splitting. More comprehensive 
discussion of the simulations and the classification is left 
for Sec. 0. 

5.5. Discussion 

Clearly EMLDETECT and WAVDETECT have problems in 
restoring the fluxes of extended objects. We have already 
discussed the splitting difficulties of EMLDETECT. The ex- 
planation for WAVDETECT's bad results is that the wavelet 
scale at which the detected object size is closer to the PSF 
size defines the source detection cell (in which the flux is 
computed). If the characteristic size of an object is larger 
than the PSF size (i.e. an extended object) this procedure 
will tend to underestimate the flux. 

We can safely accept the MR/l+SE procedure as the 
best performing for detection and characterization both 
for point-like and extended objects. We must stress how- 
ever that one cannot rely on the flux measurements when 
there are extended and point-like sources separated by less 
than 30". The proximity affects also the classification of 
the point-like sources. Using the classification and then 
performing more complicated analysis like profile fitting 
and integration for the extended sources can improve a 
lot the restoration. In realistic situations we can expect 
very often problems of this kind, especially with XMM- 
Newton . 

6. Test 3 — Realistic model vifith only point-like 
sources 

6.1. Input configuration 

We simulate an extragalactic field including only point-like 
sources with flu xes drawn from the log — log 5* relation 
(Hasinger et al. |l998| , |200li Giacconi et al. ^000| ). PSF, vi- 
gnetting and background models are applied as described 



Table 6. One sigma positional error and number of de- 
tected objects inside the inner 10' from the center of the 
FOV and more than 100 counts for a 10 ks exposure. 



Procedure 


Ar" 


number 


EMLDETECT 


2.9 


13 


G+SE 


3.5 


14 


MR/l+SE 


3.2 


13 


WAVDETECT 


4.1 


12 



in Sec. The aim is to test the detection procedures in 
more realistic cases where confusion and blending effects 
are important and not controlled. The raw photon image 
is shown in Fig. |^ together with its visual representation 
- the same input configuration for a much larger exposure 
time and no background, only keeping the objects with 
counts greater than 10. It displays better the input object 
sizes, fluxes and positions and it is instructive to compare 
it to the MR/1 flltered and WAVDETECT images shown on 
the same figure. 

6.2. Cross-identification and positional error 

We need to define a searching radius in order to cross- 
identify the output and the input lists. The input list con- 
tains many objects with counts well below the detection 
limit (log N — log S extends to very faint fiuxes) and a 
lower limit must be chosen. For each detected object, we 
search for the nearest neighbour inside a circle within the 
reduced input list. 

The positional difference for the brightest detected 
sources (more than 100 counts) in the inner 10' from the 
center of the FOV is shown in Tab. ^. The region beyond 
10' is subject to serious problems caused by the vignetting 
and PSF blurring, the detected object centroid can be few 
PSFs widths from the true input identification. 

We therefore adopt the following cross-identification 
parameters: the input list is constrained to counts greater 
than 10; a 6" searching radius; we consider only the central 
10' of the FOV. 

6.3. Detection rate and photometric reconstruction 

The detection rate and flux reconstruction results are 
shown in Fig. ^ There are different effects playing a role 
in the distribution and the numbers of missed and false 
detections: 

(1) "false" detections - non-existent objects, or two or 
more sources blended into a single detected object. The 
result will be a "false" detection if the blended objecs 
are not in the input list (count limit) or the merged 
object centroid is beyond the searching radius. 

(2) source confusion - in the cross-identiflcation process 
the nearest neighbour to the detected source is not the 
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Fig. 6. Test 2. MR/l+SE detection classification based on R^q (left panels) and stellarity (right panels) as a function 
of the off-axis angle (upper panels) and detected source counts (lower panels) . Identified extended (filled circles) and 
point-like objects (triangles) are plotted over the results from 10 simulated images with only point-like sources (see 
Sec. I). 



cff-oxis [orcrnin] 



Fig. 7. Test 2. WAVDETECT classification based on object size to PSF size ratio - Rpsp- The identified extended (filled 
circles) and point-like sources (triangles) are plotted over the results from 10 simulated images with only point-like 
sources (see Sec. the dashed line marks a ratio of unity . 



true assignment; or as in case (1), when a blend of 
objects is wrongly identified by one input source. 
(3) missed detections - depending on the local noise prop- 
erties, some objects can be missed even if their input 
counts are above the adopted limiting counts for cross- 
identification. 

6.4- Discussion 

The results in terms of detection rate are similar for all 
procedures. The best detection rate shows G+SE but at 
the price of twice as many false detections. 



The photometry reconstruction for the sources above 
50 counts shows a spread about 10-15% for the WT based 
methods and - 25% for EMLDETECT. However, EMLDETECT 
clearly outperforms the other procedures when we use 
the same PSF model as the one hard-coded into the pro- 
gramme. This fact shows that using a correct PSF repre- 
sentation has a crucial importance for the ML technique. 
More discussion about the detection limits, completeness 
and confusion is left for Sec. 
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Fig. 8. Test 3. A simulated XMM-Newton extragalactic field with only point-like sources for lOks exposure time and 
the total sensitivity of the three EPIC instruments (upper left), its representation for much larger exposure time and 
no background (upper right). The MR/l+SE filtering (lower left) and WAVDETECT images both with 10"'* significance 
threshold (lower right) are shown. 



7. Test 4 sky models with point-like and 
extended objects 

7.1. Input configuration 

We have chosen exactly the same input point-like sources 
configuration as in Test 3 and we have added 5 extended 
objects. They may be regarded as clusters of galaxies at 
different redshifts with the same /3-profiles and moder- 
ate X-ray luminosity L[2-io]keV ~ 3 x lO**^ erg/s (c.f. 
Tab. 1^). Now the objective is to estimate the capabilities 
of the procedures to detect and identify extended objects. 
The difference from Test 2 is the arbitrary positions of 
the point-like sources leading to different local and global 
background properties and to uncontrolled confusion ef- 
fects. 



The input configuration and wavelet filtered and out- 
put images are given in Fig. ^ 



7.2. Positional and photometric reconstruction 

Results for the point-like sources will not be presented, 
because the input configuration (position, log N — log S, 
background) is exactly the same as in the previous test. 
It was shown in Test 2 that the presence of a point 
source even with moderate counts in the vicinity of a faint 
extended source could lead to confusion and even non- 
detection. Of course, the presence of a cluster will affect 
the detection and photometry of the point-like objects in 
its vicinity, but this effect is of minor concern for this test 
and has been already discussed (Test 2). 
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Fig. 9. Test 3. Recovery of the input flux (upper panel of each figure). The continuous hne is exact match between 
detected and input counts while the dashed lines are for two times differences. The limit of 50 input counts is shown 
by a vertical dotted line and the mean and the st.dev. (in brackets) of SCT S{out) / SCT S{in) for the two regions are 
indicated. The black circles denote objects with detect-input position difference larger than 4", suggesting blending 
effects; the grey circles denote objects with more than one neighbour inside the searching radius. In the bottom 
panels, the corresponding rate and distribution of input counts (continuous histogram), missed input objects (dashed 
histogram) and possible false detections (dotted histogram) are shown. 



The detection rate, input-detect position offsets, de- 
tected counts and detected-to-input counts ratio are 
shown in Tab. 0. 

As to the positional errors, it was already concluded 
that the centres of the extended object can be displaced 
by more than the adopted searching radius for point-like 
sources (Sec. The differences in positions shown in 
Tab. 1^, especially for fainter objects, are 3-4 times larger 
than the one sigma limit for point-sources inside the inner 
10' of the FOV (Tab. |). 

It is confirmed again that EMLDETECT and WAVDETECT 
are not quite successful in charactering extended objects. 
But note that this time the results for MR/l+SE and G+SE 



are worse than the results in Test 2 - the rate of lost 
photons being about 20-30%. Also, the flux of the clusters 
at z — 1.5 and 2 is overestimated, suggesting blending 
with faint nearby point-like sources. 

7.3. Classification 

To classify the detected objects we have performed many 
simulations with only point-like sources as in Test 3. Ten 
simulated images were generated with the same log N — 
log S and background, but with different and arbitrary po- 
sitions of the input sources. Exactly the same parameters 
were used for filtering, detection and characterization. 
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Fig. 10. Test 4. The raw X-ray photon image with point-hke and extended objects for lOks exposure time (upper left) 
and its visual representation for much larger exposure and no background (upper right). The corresponding redshifts 
for the clusters are indicated. The MR/l+SE filtering (lower left) and WAVDETECT both with 10~^ significance threshold 
(lower right) are shown. 



Two classification parameters were used: the half-light 
radius (i?5o) and the stellarity index from SExtractor 
based procedures. The results are shown in Fig. We 
do not show results with WAVDETECT and EMLDETECT clas- 
sification parameters because their unsatisfactory results 
were confirmed, as in Test 2. 

We can see the excellent classification based on the 
stellarity index and half-light radius: in the inner 10', for 
objects with more than 20 detected counts, stellarity less 
than 0.1 and R50 > 20" we have 15 incorrect assignments 
from 1287 detections (- 1%). 



8. Test 5 completeness and confusion 

In this section we investigate the confusion and complete- 
ness problems for XMM-Newton shallow and deep obser- 



vations like the first XMM Deep X-ray Survey in the Lock- 



man Hole (Hasinger et al. 2001). 

A set of 10 images with exposure times of 10 ks and 
100 ks in the energy bands [0.5-2] and [2-10] keV were gen- 
erated; the fiuxes were drawn using the latest log TV — log S 
relations from Hasinger et al. (2001) and Giacconi et al. 
( 2000| ). Detection and analysis were performed with ex- 



actly the same parameters for all simulations: detection 
threshold, analysis threshold, background map size, detec- 
tion likelihood, etc. (see Sec. ||). Cross-identification was 
achieved using the input sources above 10 counts and 30 
counts for 10 ks and 100 ks exposures respectively. Lower- 
ing the count limits yields more cross-identifications but 
increases considerably the number of spurious detections. 

The input image for 100 ks and [0.5-2] keV band is 
shown in Fig. |l^. The inner 10' zone where all analysis is 
performed is indicated, as well as the total XMM-Newton 
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Fig. 11. Test 4. Half-light radius -R50 (left figures) and stellarity index (right) as a function of the off-axis angle (up) and 
detected counts (down) for 10 generations. The detection was performed with SExtractor after MR/ 1 multiresolution 
filtering. There are in total 1287 detections indicated by points. The extended objects from this test are shown with 
filled circles. 



Fig. 12. Simulated lOOks XMM-Newton deep field in [0.5- 2] keV with th e sam e parameters {Nh, logiV - log 5, T, 
background) as in the Lockman Hole (Hasinger et al. 2001 , Watson et al. 2001). We restrict the analysis to the inner 
10'. 



field-of-view. It is informative to compare it with the im- 
ages for 10 ks in Fig. ||. 

In order to estimate the effect of confusion we have 
generated images with only point-like sources, distributed 
on a grid such to avoid close neighbours, and with fixed 
fluxes spanning the interval [10~^^, 10~^^] erg/s/cm'^. 

In the following we discuss some important points. 

— Confusion and completeness 

The detection rate of the input sources as a function 
of flux (Fig. |l^) indicates that confusion problems are 
significant for 100 ks exposures in the [0.5-2] keV band. 



They are marginal for 100 ks exposures in [2-10] keV 

band and absent for 10 ks in both bands. 

Energy conversion factors and the limiting fluxes below 

which more than 10% of the input objects are lost are 

shown in Tab. |[ 

Differential flux distribution 

The differential flux distributions for 100 ks in [0.5-2] 
keV obtained by MR/l+SE and EMLDETECT are shown on 
Fig. |lj. Spurious detections appears to be numerous 
with EMLDETECT below 5 x lO"^*" erg/s/cm^ and tend 
to compensate the loss of sensitivity and confusion. 
MR/l+SE appears to be less affected and allows us to set 
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Fig. 13. Ratio of number of cross-identified objects to the number of input objects, with counts greater than 10 and 
30 for 10 ks and 100 ks exposures respectively, as a function of the input flux. The results of 10 generations are shown 
by crosses, the heavy histogram is the corresponding average; the curve indicates the detection rate if confusion is 
absent and the dotted line marks 90% completeness. 



a conservative flux limit of 6 x 10^^^ crg/s/cm^(~ 90 
photons for 100 ks), below which the incompleteness 
becomes important - 65% of the input sources are lost 
between 3 x 10~^^ and 6 x 10~^^ erg/s/cm^. 
log - log S* 

The logA^ - logS" functions in [0.5-2] keV for 10 ks 
and 100 ks exposures are shown in Fig. |l^. The in- 
ferred logiV — logS* (by simple source counting) are 
in very good agreement with the input ones down 
to fluxes about 2 x 10"-^^ and 6 x 10~^^ erg/s/cm^ 
with MR/l+SEand 10~^^ and 3 x lO"^*" erg/s/cm^ for 
EMLDETECTfor 10 ks and 100 ks respectively. Although 
there are confusion and completeness problems for 
100 ks starting at 5 — 6 x 10~^^ erg/s/cm^as dis- 
cussed above, their effect is completely masked in the 
EMLDETECT integral distribution, which seems to be in 



very good agreement down to 3 x 10^^® crg/s/cm^(the 
lower flux limit for the Lockman Hole Deep Survey 
analysis of Hasinger et al. 2001). 
Photometric accuracy 

The photometric reconstruction is relatively similar for 
lOks and lOOks in for fluxes greater than 2 x 10~^^ and 
6 X 10~^^ erg/s/cm'^ respectively (the 90% complete- 
ness limit. Fig. p^ ). The flux uncertainties are about 
25-30 % and going down to fainter fluxes leads to very 
poor photometry. 



9. Conclusions 

Various procedures for detecting and characterizing 
sources were tested by simulated X-ray images. We have 
concentrated our attention mainly on images with XMM- 
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Fig. 14. Differential number counts as a function of flux. The continuous histogram is the input distribution, the 

dots with error bars (Poissonian) arc the measured distribution (without any cross-identifications) and the dashed 
histogram indicates the false detections - all are averaged from 10 simulations. Left for MR/l+SE, right for EMLDETECT. 




Fig. 15. The integral number of objects in the inner 10' (surface 0.087 sq.deg.) as a function of the flux for [0.5-2] 
keV and two exposures: 10 ks and 100 ks. The points with error bars (Poissonian) are the detections (without cross- 
identiflcation) with MR/l+SE boxes are EMLDETECTresults, while the histogram is the input log A'' — log 5 function. 



Newton speciflc characteristics, because the problems aris- 
ing from its high sensitivity and relatively large PSF are 

new and challenging. 

We have analyzed the detection rate and the recovery 
of all characteristics of the input objects: flux, positional 
accuracy, extent measurements and the recovery of the 
input logiV — log 5 relation. We have also investigated 
confusion problems in large exposures. 

Concerning detection rate and characteristics recon- 
struction, we have shown that the VTPDETECT implemen- 
tation of the Voronoi Tessellation and Percolation method 
is not suited to XMM-NeAuton images. EWAVELET provides 
very good detection rate and photometric reconstruction 



for point-like sources after a simple correction, but shows 
unreliable results for extended sources. 

One of \\ic best methods for point- like source detection 
and flux measurements is EMLDETECT but we stress again 
that the PSF model used for the ML procedure needs to 
be close to the image PSF for the most accurate photom- 
etry. Serious drawbacks are the relatively large number of 
spurious detections as well as the splitting of the extended 
sources, which we were not able to suppress even with 6 
simultaneous PSF profile fits in the multi-PSF mode; this 
seriously hampers the analysis of the extended sources. 

WAVDETECT is a flexible method giving good detections 
even in some complicated cases. But, here again, spurious 
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Fig. 16. Photometry reconstruction for all 10 simulated images at 10 ks (left) and 100 ks (right) in the [0.5-2] 
keV band. The solid line is exact match between detected and input counts while the dashed lines are for two-fold 
differences. The vertical dashed line marks the 90% completeness limit (see Tab. ||) and mean and st.dev. (in brackets) 
above this limit are denoted. 



Table 7. 

objects. 



Recovery of position and flux of the extended 



redshift 


Ar 


Input Detect 


Detect /Input 




[arcsec] 


[counts] [counts] 


[%] 






EMLDETECT 




0.6 


2.1 


1316 94 


7 


1.0 


8.0 


465 12 


3 


1.5 


12.3 


200 161 


81 


1.8 


4.6 


136 228 


167 


2.0 


14.8 


109 32 


29 






G+SE 




0.6 


1.2 


1316 1043 


79 


1.0 


5.2 


465 355 


76 


1.5 


1.9 


200 220 


110 


1.8 




Not detected 




2.0 


15.3 


109 80 


73 






MR/l+SE 




0.6 


0.2 


1316 1016 


77 


1.0 


2.3 


465 340 


73 


1.5 


1.8 


200 223 


111 


1.8 


11.8 


136 83 


61 


2.0 


10.8 


109 185 


169 






WAVDETECT 




0.6 


5.8 


1316 344 


26 


1.0 


10.6 


465 193 


41 


1.5 


0.1 


200 39 


19 


1.8 




Not detected 




2.0 


15.3 


109 27 


24 



detections are quite numerous. WAVDETECT does not as- 
sume a PSF model but requires the PSF size as a function 
of the encircled energy fraction and the off-axis distance 
in order to define the object detection cell. However, the 



Table 8. The energy conversion factors (ECF) and 
the 90% completeness limits for detections. ECF is com- 
puted assuming power-low spectrum with photon index 
F = 2, Nh = 5x 10^^ cm-2 (Hasinger et al. ^00l\) and 



the three EPIC instruments (pn-|-2M0S) with thin filters. 



[0.5-2] keV [2-10] keV 
ECF (cts/s per erg/s/cm^) 



6.70 X 10"'^ 3.66 X 10"^^ 
Flux limits (erg/s/cm^) 
lOks 2 X 10"^^ 10"" 
looks 6 X lO"^*^ 3 X 10"^^ 



way the detection cell is defined leads to bad photometry 
for extended objects. 

Our choice is the MR/l+SE method. The mixed ap- 
proach involving first a multircsolution iterative thresh- 
old filtering of the raw image followed by detection and 
analysis with SExtractor. Our tests have shown that this 
is the best strategy for detecting and characterizing both 
point-like and extended objects . Even though this mixed 
approach consists of two distinct steps, it is one of the 
fastest procedures (Tab. H), allowing easy checks of differ- 
ent stages in the analysis (filtering, detection, photome- 
try). 

Without blending or confusion effects, the photometry 
is accurate within 10-20% for both point-like and extended 
objects. This uncertainty can be regarded as an intrinsic 
error due to the Poissonian nature of the X-ray images. 
For extended objects, only the MR/l+SE method gives sat- 
isfactory photometric results. 

Blending between extended and point-like sources is 
quite serious at separations below ~ 30". Better results 
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Table 9. CPU times for performing Test 5 for lOOks. 
The procedures were run on a Pentium Ill-Xeon, 550MHz, 
Linux. G+SE and MR/l+SE CPU time is the total of fil- 
tering and detection passes. For EMLDETECT the number of 
input objects is given. 



Procedure Number of CPU time 





detections 


[min.] 


EMLDETECT 


528 


12.0 


EWAVELET 


364 


0.4 


MR/l+SE 


370 


1.9 


G+SE 


365 


0.1 


WAVDETECT 


378 


10.3 


VTPDETECT 


1307 


10.7 



the MR/1 software, R. Ogley and A. Refregier for valuable com- 
ments on the manuscript, H. Bruner and J. Ballet for comments 
and help on XMM-SAS and EMLDETECT, E. Bertin for help on 
SExtractor. We thank also the referee for valuable comments 
and suggestions on the manuscript. 
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